From Elementary Discourse Units to Complex Ones

نویسنده

  • Holger Schauer
چکیده

Coherence relations have usually been taken to link clauses and larger units. After arguing that some phrases can be seen as discourse units, a computational account for such phrases is presented that integrates surface-based criteria with inferential ones. This approach can be generalized to treat intra-sentential cue-phrases. Since cue-phrases are not always present, referential relations between nominal expressions are additionally used to derive a text 's discourse structure. :1 I n t r o d u c t i o n It is widely acknowledged that texts are not just collections of sentences, but have a structure of their own. There has been substantial work in order to account for the different phenomena of discourse structure by applying theories of coherence relations, e.g. (Mann and Thompson, 1988; Asher, 1993) among others. Coherence relations represent rich semantic linkage (like Cause t or Evaluation) between text segments of varying size. However, what the minimmn size of text segments to relate should be, is still left open to debate. As common approaches argue that coherence relations relate events or situations (e.g. (Hobbs et al., 1993; Asher, 1993)) and that such events are usually introduced 1Coherence relations in this paper are basically tulc~n from taken Rhetorical Structure Theory (Mann and Thompson, 1988) will appear emphasized and Capitalized. through the means of verbs, it has become standard practice to consider clauses to be the appropriate size for elementary discourse units. It has, however, also been observed (Vander Linden and Martin, 1995; Grote et al., 1997) that sometimes phrases may serve as very condensed forms to express elaborate contents. Recently, (Schauer and Hahn, 2000) provided a more detailed analysis when prepositional phrases (PPs) may serve as elementary discourse units. Cursorily viewed, the claims of another recent study stand in contrast to the idea of intra-clansal discou~e units: (Schauer, 2000) examined the interplay of coreferential expressions and discourse structure and concluded that referential relations are a good indicator of the discourse structural configurations in case the units examiued are entire sentences. This poses the question whether not entire sentences are the appropriate grain size for elementary discourse units. I will argue that these results i.e. the different levels of granularity for discourse units are not incompatible with each other. The approach used in (Schauer and Hahn, 2000) to derive the coherence relation governing a prepositional phrase neatly carries over to the computation of coherence relations signaled by sentence-internal cue-phrases. This then allows an integration with the algorithm using referential relations that was proposed in (Schauer, 2000). 2 A d j u n c t s a s D i s c o u r s e U n i t s The question at what size of textual expressions one should start looking for discourse units has not been suftlciently answered yet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complex discourse units and their semantics

A natural and intuitive principle concerning the organization of content in discourse is that discourse structure and rhetorical function operate at several levels of granularity at once. There are low level discourse connections between elementary discourse units (EDUs), even within a single sentence; but there are also discourse connections between larger constituents, complex discourse units...

متن کامل

Multimodal Discourse: In Search of Units

Human communication is inherently multimodal. In this study we focus on three channels of spoken discourse: the verbal component, prosody, and gesticulation. We address the question of units that can be identified within these components and in spoken multimodal discourse as a whole. The basic unit of the verbal channel is the clause, reporting an event or a state. A set of prosodic criteria he...

متن کامل

The automatic identification of discourse units in Dutch text

The identification of discourse units is an essential step in discourse parsing, the automatic construction of a discourse structure from a text. We present a rule-based algorithm to identify elementary discourse units (EDUs) in Dutch written text. Contrary to approaches that focus on the determination of segment boundaries, we identify complete discourse units, which is especially helpful for ...

متن کامل

Thai Rhetorical Structure Analysis

Rhetorical structure analysis (RSA) explores discourse relations among elementary discourse units (EDUs) in a text. It is very useful in many text processing tasks employing relationships among EDUs such as text understanding, summarization, and question-answering. Thai language with its distinctive linguistic characteristics requires a unique technique. This article proposes an approach for Th...

متن کامل

Discourse Segmentation of German Written Texts

Discourse segmentation is the division of a text into minimal discourse segments, which form the leaves in the trees that are used to represent discourse structures. A definition of elementary discourse segments in German is provided by adapting widely used segmentation principles for English minimal units, while considering punctuation, morphology, sytax, and aspects of the logical document st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000